Semi-automatic phonemic labelling of speech data using a self-organising neural network
نویسنده
چکیده
In the perspective of assessing existing and new speech input and output devices, the development of methods for computerised semi-automatic labelling of speech data is becoming increasingly important. This paper describes preliminary work to achieving this goal by utilising Neural Network technique to perform phonemic classification. The speech data used for Neural Network learning are taken from the SAM-EUROM.O database, which has been manually labelled by expert phoneticians as a reference. Separate data from 1 male speaker are used to test the performance of the semiautomatic phonemic labelling system. The preliminary results show an average classification rate of 65 % an the vocalic and 72% an the consonantal Danish archiphonemes.
منابع مشابه
Pseudo-articulatory speech synthesis for recognition using automatic feature extraction from x-ray data
We describe a self-organising pseudo-articulatory speech production model (SPM) trained on an X-ray microbeam database, and present results when using the SPM within a speech recognition framework. Given a time-aligned phonemic string, the system uses an explicit statistical model of co-articulation to generate pseudoarticulator trajectories. From these, parametrised speech vectors are synthesi...
متن کاملSpeech Processing Using Artificial Neural Networks
A three layer perceptron network is used to classify the /i/ sound using isolated words from diierent speakers. A classiication accuracy of 97% has been achieved. A map of phonemes is used to trace trajectories of utterances using the self-organising neural network. A crinkle factor is proposed which allows using the self-organising map to determine the inherent dimensionality of a set of point...
متن کاملA Comparative Study of the Multi-Layer Perceptron, the Multi-Output Layer Perceptron, the Time-Delay Neural Network and the Kohonen Self-Organising Map in an Automatic Speech Recognition Task
This paper describes a study of the use of four different neural network techniques for automatic speech recognition (ASR) using two common, real-world application databases. The neural network techniques investigated were the Multi-Layer Perceptron (MLP), the Multi-Output-Layer Perceptron (MOLP), which is an improved version of the MLP , the Time-Delay Neural Network (TDNN) and the Kohohnen Se...
متن کاملVisualising Class Distribution on Self-organising Maps
The Self-Organising Map is a popular unsupervised neural network model which has been used successfully in various contexts for clustering data. Even though labelled data is not required for the training process, in many applications class labelling of some sort is available. A visualisation uncovering the distribution and arrangement of the classes over the map can help the user to gain a bett...
متن کاملAutomatic Pronunciation Generation by Utilizing a Semi-Supervised Deep Neural Networks
Phonemic or phonetic sub-word units are the most commonly used atomic elements to represent speech signals in modern ASRs. However they are not the optimal choice due to several reasons such as: large amount of effort required to handcraft a pronunciation dictionary, pronunciation variations, human mistakes and under-resourced dialects and languages. Here, we propose a data-driven pronunciation...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1989